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Abstract 


We establish a new bridge between propositional logic and elementary 
number theory. A full clause in a conjunctive normal form (CNF) contains all 
variables, and we study them in minimally unsatisfiable clause-sets (MU); such 
clauses are strong structural anchors, when combined with other restrictions. 
Counting the maximal number of full clauses for a given deficiency k, we 
obtain a close connection to the so-called “Smarandache primitive number” 
$2(k), the smallest n such that 2* divides n!. 

The deficiency k > 1 of an MU is the difference between the number of 
clauses and the number of variables. We also consider the subclass UHIT of 
MU given by unsatisfiable hitting clause-sets, where every two clauses clash. 
While MU corresponds to irredundant (minimal) covers of the boolean hyper- 
cube {0,1}” by sub-cubes, for UHIT the covers must indeed be partitions. 

We study the four fundamental quantities FCH, FCM, VDH, VDM: N > 
N, defined as the maximum number of full clauses in UHIT resp. MU, resp. the 
maximal minimal number of occurrences of a variable (the variable degree) 
in UHIT resp. MU, in dependency on the deficiency. We have the relations 
FCH(k) < FCM(k) < VDM(k) and FCH(k) < VDH(k) < VDM(k), together 
with VDM(k) < nM(k) < k+1+log,(k), for the “non-Mersenne numbers” 
nM(k), enumerating the natural numbers except numbers of the form 2” — 1. 

We show the lower bound S2(k) < FCH(k); indeed we conjecture this 
to be exact. The proof rests on two methods: Applying an expansion pro- 
cess, fundamental since the days of Boole, and analysing certain recursions, 
combining an application-specific recursion with a recursion from the field of 
meta-Fibonacci sequences. 

The S2-lower bound together with the nM-upper-bound yields a good han- 
dle on the four fundamental quantities, especially for those k with S2(k) = 
nM(k) (we show there are infinitely many such k), since then the four quanti- 
ties must all be equal to S2(k) = nM(k). With the help of this we determine 
them for 1 <k < 13. 
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1 Introduction 


We study combinatorial parameters of conjunctive normal forms (CNF's) F’, con- 
junctions of disjunctions of literals, under the viewpoint of extremal combinatorics: 
We maximise the number of “full clauses” in F' for a given “deficiency” 5(F’), where 
not all F are considered (that number would not be bounded), but only “minimally 
unsatisfiable” F’. We use exact methods, establishing links to elementary number 
theory and to the theory of special recursions. 

To help the reader, we give now the definitions, in a somewhat unusual way, 
which is nevertheless fully precise. CNFs as combinatorial objects are “clause- 
sets”, where for this introduction we just use natural numbers (positive integers) 
as logical “variables”. More precisely, we consider non-zero integers as literals x 
with arithmetical negation —a the logical negation, while clauses are finite sets 
C of Literals (non-zero integers), such that for « € C we don’t have —x € C 
(logically speaking, C must not be tautological), and clause-sets F' are finite sets 
of clauses. The set var(F’) of variables of F is the set of absolute values of literals 
occurring in F’. A full clauses C € F is a clause of maximal possible length, that 
is, of length |var(F’)|, in other words, all variables must occur in C' (negated or 
unnegated); the number of full clauses of F’ is denoted by fc(F’). A clause-set F 
is satisfiable iff there exists a clause C (which represents the set of “literals set 
to true”), which intersects all clauses of F' (note that this is non-trivial, since C 
must not contain complementary literals « and —), otherwise F is unsatisfiable. 


Moreover, unsatisfiable F’, where removal of any clause makes them satisfiable, are 
called minimally unsatisfiable, while the set of all of them is denoted by MU. The 
main parameter is the deficiency 6(F) = c(F) — n(F) € Z, where c(F) := |F| is 
the number of clauses, and n(£’) := |var(F’)| is the number of variables. The most 
basic result of the field, “Tarsi’s Lemma” ({I]), states 6(F') > 1 for F € MU. An 
example of an unsatisfiable clause-set is {{—1}, {1}, {1,2}}, which is not minimal, 
but Fy := {{-1}, {1}} e MY, with 6(F,) = 2-—1=1 and fe(F,) = 2. An example 
of F € MU with fc(F’) = 0 is Fy := {{-1, 2}, {—2, 3}, {-3, 1}, {1, 2}, {-2, -3}}, 
where 6(F)) = 2. Indeed we mainly concentrate on a subset of MU, namely 
UHTT Cc MU, the unsatisfiable hitting clause-sets, given by those F € MU such 
that for each C,D € F, C # D, there is a “clash”, that is, there is x € C with 
—a € D. We have F, © UHTT and Fh ¢ UHTT; the latter can be “repaired” with 
Fs := {{-1, 2}, {2,3}, {-3, 1}, {1, 2, 3}, {-1, —2, -3}} € UHTT (still 6(F3) = 2, 
but now fc(F3) = 2). 

Now we denote by FCM(k) the maximum of fe(F’) for F € MU with 6(F) =k 
(short: F € MUs=;,). From Theorem 15] follows the upper bound FCM(k) < 
nM(k) for the non-Mersenne numbers nM(k) € N, with k+ |log,(k+1)| <nM(k) < 
k+14+ |log(k)| ({23) Corollary 10}). Until now no general lower bound on FCM(k) 
was known, and we establish S2(k) < FCM(k). Here S2(k), as introduced in [30], 
is the smallest n € No such that 2” divides n!, and various number-theoretical 
results on Sz and the generalisation S, for prime numbers p are known. Actually 
we show a stronger lower bound, namely we do not consider all F € MUs x, 
but only those F € UHTIT, yielding FCH(k) with FCH(k) < FCM(k), and we show 
So < FCH. The elements of “HIT are known in the DNF language as “orthogonal” 
or “disjoint” tautological DNF, and when considering arbitrary boolean functions, 
then also “disjoint sums of products” (DSOP) or “disjoint cube representations” 
are used; see Section 4.4] or [6] Chapter 7]. 


1.1 Background 


The central underlying research question is the programme of classification of MU 
in the deficiency, that is, the characterisation of the layers MU; =, fork € N. A 
special case of the general classification is the classification of UHI T5—,. The earliest 
source [I] showed (in modern notation) 6(F) > 1 for F € MU, and characterised 
the special case SMUs—1 C MUs—1, where SMU C MU contains those F € MU 
such that no literals can be added to any clauses without destroying unsatisfiability. 
Later characterised MUs5~1 via matrices, while the intuitive characterisation via 
binary trees was given in Appendix B], where also SMUs=1 = UHTTs=1 has 
been noted. In the form of “S-matrices”, the class MUs;—, had been characterised 
earlier in [13], going back to a conjecture on Qualitative Economics ({9]), and 
where the connections to this field of matrix analysis, called “Qualitative Matrix 
Analysis (QMA)”, were first revealed in (see Subsection 11.12.1] and 
Subsection 1.6.4] for overviews). Another proof of 6(F’) > 1 for F € MU is obtained 
as a special case of [2] Corollary 4], as pointed out in [3]. 

SMUs-=2 and partially MUs=2 were characterised in [16], with further informa- 
tion on MUs=2 in [24]. showed that all layers MUs—; are poly-time decidable. 

A key element for these investigations into the structure of MU is the min-var- 
degree pvd(F) := minyevar(r)|{C € F : {-v,v} NC F 0}, the minimal variable- 
degree of F’, and its maximum VDM(k) over all F € MUs=x. Indeed the key to 
the characterisation of MU; 1 in [7] as well as in was the proof of VDM(1) = 
2. The first general upper bound Vk € N : VDM(k) < 2k was shown in 
Lemma C.2]. Now in [23], mentioned above, we actually showed the upper bound 
VDM(k) < nM(k). Using fc(F’) for the number of full clauses in F’, obviously 
fc(F) < pvd(F) holds. FCM(k) is the maximum of fc(F’) over all F € MUsex, 


thus FCM(k) < VDM(k). 

In [25] Section 14] we improve the upper bound to VDM < nMj,, based on two 
results: VDM(6) = nM(6) — 1 = 8, and a recursion scheme, transporting this im- 
provement to higher deficiencies, obtaining nM, from nM, where for infinitely many 
k holds nM, (k) = nM(k) —1. The proof of VDM(6) = 8 contains an application of 
full clauses, namely we use FCM(3) = 4. 

For the variation VDH(k) < VDM(k), which only considers hitting clause- 
sets, we conjecture VDH(k) = VDM(k) for all k > 1. Furthermore we conjecture 
FCM(k) > nM(k) — 1, and thus the quantities nM(k), VDM(k), VDH(k), FCM(k) 
are believed to have at most a distance of 1 to each other. On the other hand we 
conjecture FCH(k) = S2, where S2(k) oscillates between the linear function k& + 1 
and the quasi-linear function & + 1+ |log,(k)|. Altogether the “four fundamental 
quantities” FCH,FCM, VDH, VDM seem fascinating and important structural pa- 
rameters, whose study continues to reveal new and surprising aspects of MU and 
UHTT; see Section [8] for some final remarks. 

It is also possible to go beyond MU: in Section 9] it is shown that when 
considering the maximum of pvd(F'’) over all F € LEANs=~ D MUs=rx, the set 
of all “lean” clause-sets, that then nM(k) is the precise maximum for all k > 1. 
Lean clause-sets were introduced in [I9] as the clause-sets where it is not possible to 
satisfy some clauses while not touching the other clauses, and indeed were already 
introduced earlier, as “non-weakly satisfiable formulas (matrices)” in the field of 
QMA by [14]. Furthermore it is shown in [25] Section 10], that there is a polytime 
“autarky reduction” , removing some clauses which can be satisfied without touching 
the other clauses, which establishes for arbitrary clause-sets F' the upper bound 
pvd(F) < nM(d(F)); an interesting open question here is to find the witnessing 
autarky in polynomial time. 


1.2. The lower bound 


Back to the main result of this report, the proof of Sz < FCH is non-trivial. Indeed 
the proof is relatively easy for a function $5(k) defined by an appropriate recursion, 
motivated by employing “full subsumption extension” C'~» CU {v}, CU {TU} in an 
optimal way. Then the main auxiliary result is $4 = Sj. For that we use another 
function, namely a2(k) as considered in [26] in a more general form, while az was 
introduced with a small modification in [5]. These considerations belong to the field 
of meta-Fibonacci sequences, where special nested recursions are studied, initiated 
by {10} Page 145]. Via a combinatorial argument we derive such a nested recursion 
from the course-of-value recursion for $5, which yields $4 = 2a,. We also show 
2a2 = Sq (this equality was conjectured on the OEIS [29]), and we obtain S$ = Sb. 

We obtain the inequality Sy < nM, with the four fundamental quantities sand- 
wiched inbetween. The deficiencies k where equality holds are collected in the set 
SNM, which we show has infinitely many elements. For the elements k of SVM, 
the four fundamental quantities coincide with S2(k) = nM(k), which yields islands 
of precise knowledge about the four quantities. We apply this knowledge to deter- 
mine the four quantities for 1 <k < 18. 


1.3. Overview on results 


The main results of this report are as follows. Theorem [8.16] proves Sz = 2ag. 
Theorem [4.15] shows a meta-Fibonacci recursion for $4, where $% is introduced by 
a recursion directly related to our application. Theorem |[417]then proves S$ = So. 
After these number-theoretic preparations, we consider subsumption resolution and 
its inversion (extension); Theorem [5.5] combines subsumption extension and the 
recursion machinery, and shows Sg < FCH. In the remainder of the report, this 


fundamental result is applied. Theorem [6.1] proves a tight upper bound on S%, 
while Theorem[6.4] considers the cases where the lower bound via $2 and the upper 
bound via nM coincides. Finally in Theorem[/.3]we determine the four fundamental 
quantities for 1 < k < 13 (see Table[I). 


2 Preliminaries 


We use Z for the set of integers, No := {n € Z: n > 0}, and N := No \ {0}. For 
maps f,g: X > Zwe write f<gifVxaEX: f(x) < g(a). 

On the set LIT of “literals” we have complementation z € LIT # % € LIT, 
with #4 a and F = x. We assume Z \ {0} C LIT, with 7 = —z for z € Z \ {0}. 
“Variables” VA C LIT with N C VA are special literals, and the underlying variable 
of a literal is given by var : CLIT — VA, such that for v € VA holds var(v) = 
var(U) = v, while for  € LIT \ VA holds © = var(x). For a set L C LIT we 
define L := {%: x2 € L}. A clause is a finite set C of literals with CNC = 0 (C is 
clash-free). A clause-set is a finite set of clauses, the set of all clause-sets is CLS. 

For a clause C' we define var(C) := {var(a) : 2 € C} C VA, and for a clause-set F’ 
we define var(F’) := Ucep var(C) C VA. We use the measure n(F’) := |var(F)| € No 
and c(F’) := |F'| € No, while the deficiency is 6(F’) := c(F’) — n(F) € Z. 

The set of satisfiable clause-sets is denoted by SAT C CLS, which is the set of 
clause-sets F’ such that there is a clause C’ which intersects all clauses of F, i.e., 
with VDE F:CAD #9; the unsatisfiable clause-sets are USAT := CLS \ SAT. 

The set MU C USAT of minimally unsatisfiable clause-sets is the set of F’ € 
USAT, such that for F’ C F holds F’ € SAT. The unsatisfiable hitting clause-sets 
are given by UHZT := {F CUSAT |\VC,DE F,C#D:CNDF 9}. It is easy to 
see that UHIT C MU holds, and that for all F € UHIT holds o¢p,27!¢! = 1. 
While all definitions are given in this report, for some more background see [17]. 


2.1 Full clauses 


A full clause for F € CLS is some C € F with var(C) = var(F’) (equivalently, 
|C| = n(F)), and the number of full clauses is counted by fe : CLS + No, which 
can be defined as fe(F’) := c(F'M A(var(F))), and where A(V) € UHIT for some 
finite V Cc VA is the set of all clauses C with var(C) = V. Standardised versions of 
the A(V) are Ay, := A({1,...,}) for n © No. 


Example 2.1 In general n(A,) =n, c(An) = 2” and 6(A,) = 2"—n. Initial cases 
are Ag = {1}, A, = lh tay and Ag = 6 Gat =2}, {=1, 2}, tL, —2}; {1,2}}. 


The following observation is contained in the proof of [33] Utterly Trivial Ob- 
servation]: 


Lemma 2.2 For F ©eUHTT, F 4 {1}, the number fc(F) of full clauses is even. 


Proof: Let n := n(F). We have Voep gr-ICl — 2”, and thus dcer gr-lGl ig 
even (due to n > 0). Since Vioericyyn 2”-ICl is even, the assertion follows. 


2.2 The four fundamental quantities 


For F' € CLS we define the var-degree as vdr(v) := c({C € F': uv € var(C)}) € No 
for v € VA, while in case of var(F’) 4 0 (i.e., F ¢ {T, {L}}) we define the min-var- 
degree pvd(F) := minyeyar(r) Vdr(v) € N. 


Definition 2.3 For k € N let 
e FCH(k) €N be the maximal fce(F) for F © UHTT 5x; 
e FCM(k) EN be the maximal fce(F) for F € MUsex; 
e VDH(k) €N be the maximal pvd(F’) for F € UHTTs=x; 
e VDM(k) EN be the maximal pvd(F) for F € Mus=x. 
For k =1 the case F = {1} is excluded in the last two definitions. 


By [23] Lemma 9, Corollary 10, Theorem 15]: 


Theorem 2.4 ((23]) VDM(k) < nM(k) = k + |logs(k + 1+ |log,(k + 1)])| < 
k+1+ |log(k)| for allk EN. 


Here nM: N — N is the enumeration of natural numbers excluding the Mersenne 
numbers 2” — 1 for n € N; the list of initial values is 2,4,5,6,8,9, 10,11, 12,13, 
14, 16,17 (http: //oeis.org/A062289). In Theorem 14.4] it is shown that 
VDM(6) = 8 = nM(6)—1, extending this to an improved upper bound VDM < nM, 
((25| Theorem 14.6], where nM; : N - N can be defined as follows: nMy(k) := 
nM(k) for k € N with k 4 2” —n +41 for some n > 3, while nM) (2” —n +1) := 
nM(2" —n+1)—1= 2"; see Table[I] for initial values. 


Theorem 2.5 ((25]) For k € N holds VDM(k) < nMi(k) < nM(k). 


We conclude these preparations with a special property of FCH(k) (supporting 
our Conjecture [8.1] that FCH = $2), namely by Lemma [2.2] we have: 


Corollary 2.6 FCH(k) is even for allk EN. 


3 Some integer sequences 


We review the “Smarandache primitive numbers” S2(k) and the meta-Fibonacci 
sequences a2(k). We show in TheoremB.16] that $2 = 2a2 holds. 


3.1 Some preparations 


We define two general operations a +> Aa and a+> $a for sequences a. First the 
(standard) A-operator: 


Definition 3.1 Fora: I — Z, where I C Z is closed under increment, we define 
Aa: IZ by Aa(k) := a(k +1) — afk). 


So a is monotonically increasing iff Aa > 0, while a is strictly monotonically in- 
creasing iff Aa > 1. Sequences with exactly two different A-values, where one of 
these values is 0, play a special role for us, and we call them “d-Delta”, where d is 
the other value: 


Definition 3.2 A sequence a : No > Z is called d-Delta for d € Z \ {0}, af 
Aa(No) = {Aa(n) }neno = {0, 4}. 


While the A-operator determines the change to the next value, the plateau- 
operator determines subsequences of unchanging values: 


Definition 3.3 For a sequence a: N > Z which is non-stationary (for all i there 
is 7 >i with a; # a;) we define Pa: N — N (the “plateau operator”) by letting 
Ba(n) forn €N be the size of the n-th (maximal) plateau of equal values (maximal 
intervals of N where a is constant). 


So $ a(1) is the size of the first plateau, $8 a(2) the size of the second plateau, and 
so on; Vi EN: a(i) 4 a(t +1) iff Pa is the constant 1-function. For a d-Delta 
sequence a from a and the initial value a; we can reconstruct a. 


3.2. Smarandache primitive numbers 


The “Smarandache Primitive Numbers” were introduced in Unsolved Problem 
AT]: 


Definition 3.4 For k € No let So(k) be the smallest n € No such that 2" divides 
n!. Using ordg(n), n EN, for the maximal m € No such that 2™ divides n, we get 
that S2(k) for k € No is the smallest n € No such that k < $7", ordg(é). 


So S'9(0) = 0, and AS>(No) = {0, 2}. 


Example 3.5 $2(2) = S2(3) = 4, while S9(4) = 6, since ordg(1) = ordg(3) = 0, 
while ordg(2) = 1 and ord2(4) = 2. 


The following is well-known and easy to show (see Subsection II.1 in [II] for 
basic properties of S2(k)): 


Lemma 3.6 The sequence S2(1),S2(2),52(3),... is obtained from the sequence 
1,2,3,... of natural numbers, when each element n € N is repeated ordg(n) many 
times. 


Example 3.7 The numbers So(k) fork € {1,...,25} are 2,4, 4, 6,8, 8, 8, 10, 12, 12, 
14, 16, 16, 16, 16, 18, 20, 20, 22, 24, 24, 24, 26, 28,28. The corresponding OEIS-entry is 


http: //oeis. org/A007843)| (which has 1 as first element (index 0), instead of 0 


as we have it, and which we regard as more appropriate). 


Lemma 3.8 ([32]) Fork € N holds k +1 < So(k) =k+ O(log k). 


We give an independent proof for the lower bound in Lemma|6.2] while we sharpen 
the upper bound in Theorem For more number-theoretic properties of S2 see 
31). To understand the plateaus of S2, we need the ruler function: 


Definition 3.9 Let ru, := ordg(2n) EN forn EN. 


Example 3.10 The first 30 elements of run, are 1,2,1,3,1,2,1,4,1,2,1,3,1,2,1, 


5,1,2,1,3,1,2,1,4,1,2,1,3,1,2 (nttp: //oeis. org/A001511)). 


The plateaus of Sz are given by the ruler function: in Lemma[3.6]we determined 
the number of repetitions of values v € N as ord2(v), while for the plateaus we skip 
zero-repetitions, which happen at each odd number, and thus for the associated 
index n we have n = $ for even v, and the number of repetitions is ordg(v) = 
ord2(2n); we obtain 


Lemma 3.11 B(52(k))cen = (tUn)nen- 


3.3. Meta-Fibonacci sequences 


Started by [10| Page 145], various nested recursions for integer sequences have been 
studied. Often the focus in this field of “meta-Fibonacci sequences” is on “chaotic 
behaviour”, but we consider here only a well-behaved case (but in detail): 


Definition 3.12 In [26] the sequence az : No > NEI has been defined recursively 
via 


ag(k) = ag(k am a(k = 1)) + ag(k —-1- ag(k = 2)), 
while a2(k) :=k for k € {0,1}. 


The sequence a2 was introduced in [5] as F.: N + No, with F(k) = k —1 for 
k € {1,2} and the same recursion law, which yields F(k) = a2(k — 1) for k € N. 
Furthermore, using F’(1) = F’(2) = 1 as initial conditions does not change anything 


else, and this sequence is the OEIS entry http://oeis.org/A046699 


Example 3.13 Numerical values for a2(k) and k € {0,...,27}: 0,1, 2,2,3,4, 4,4, 
5, 6,6, 7, 8,8, 8, 8,9, 10, 10,11, 12,12,12,13,14,14,15. The first five recursive com- 
putations: 


1. a2 2) = a2 2—ai(1))+a2(1—a2 0)) =a2 2—1)+a2 1-0) = a2 1) +. ae(1) = 
14+1=2. 


2. ag 3) = a2 3 —a9(2)) + a2(2—ag 1)) =a2 3—2)+a2 2-1) = a2 1) + a2(1) = 
14+1=2. 


3. a2 4) = a2 4 — a9(3)) + ao(3— ag 2)) = a2 4—2)+a9 3-2) = a2 2) +a9(1) = 


4. Q2 5) =ag 5 — ag(4)) +.a2(4— ag 3)) =ag 5—3)+ a2 4—2) =a2 2) +.a9(2) = 


5. ag 6) = a2 6 — a2(5)) + a2(5—ag 4)) =a2 6—4)+a9 5—3) = a2 2) +. a2(2) = 


It is shown (in our notation): 


Lemma 3.14 ([5]) Fork € N andp := |logy(k+1)|: ag(k) = 2?-!+a9(k+1—2?). 


Lemma yields a fast computation of a2(k). Corollary 2.9, Equation (1)] 
determines the plateau sizes: 


Lemma 3.15 ({12]) a2 is a 1-Delta sequence with B(a2(k)) ken = ru. 


We can now show az = 452, which has been conjectured on the OEIS (http: //oeis. org/A007843, 


by Michel Marcus): 


Theorem 3.16 Vk € No: S2(k) =2- ag(k). 


Proof: By Lemma and Lemma[3.15] together with 52(0) = a2(0) = 0. 


hiding two parameters d € N, s € Z used in [26], which are d = 2, s = 0 in our case 


4 Recursions for Smarandache primitive numbers 


In Subsection [4.]] we introduce the sequence S45 via a recursive process, which di- 
rectly ties into our main application in Theorem [5.5] for constructing unsatisfiable 
hitting clause-sets with many full clauses. This recursive definition uses an index, 
which is studied in Subsection [4.2] The central helper function is the “slack”, stud- 
ied in Subsection [4.3] We then prove a meta-Fibonacci recursion in Theorem [4.15 
and obtain $5 = Sp in Theorem [4.17] 


4.1 A simple course-of-values recursion 
Definition 4.1 For k € No let 
1. S5(0) :=0, S$(1) := 2; and fork > 2: 
2. Sb(k) := 2-(k-i+1) for the minimali € {1,...,k—-1} with k-i+1 < SS(i). 


Note that the recursion step is well-defined (the i exists), since for i = k — 1 holds 
k-i+1 = 2, and S4(k —1) = 2 for k = 2, while for k > 3 holds S4(k — 1) = 
2-((k—1)—#’+1) > 2-((k-1)—((kK-1)—1)41) = 4. The condition “k—i+1 < S5(i)” 
is equivalent to k+1<i+5%(i). Some simple properties are that $5(k) is divisible 
by 2, S$(k) > 2 for k > 1, and $4(2) =4 and S$(k) > 4 for k > 2. 


Example 4.2 The computations for S5(k) for1<k< 10: 
1— 2 by recursion basis 

ee aa ees ee eee ee ee a ee 
5-3 9-824) S44 6S 43 
1+ %-4-94 1) S6:0+42 555 G2 
53 2-(5-241 


i) 


+4=6>6 (i=2 


+4=7>7 (i =3 


eK Ww 


> 27 =A +6=10>8 (i=4) 


z 


8+ 2-(8-4+1)=10;4+6=10>9 i=4) 


93 2-(9-441 


) 

) 

) 

)=8 
6—> 2-(6—341) =8; 

)=8 

) 

)=12;44+6=10>10 @ =4) 

1 


10 2-(10-5+41) =12;5+8=13>11 G=5). 


4.2 Analysing the index 
Definition 4.3 For k > 0 let ig(k) =k +1—- 2W EN. 
Simple properties (for all k > 0): 

1. $3(k) =2+(k —ig(k) +1). 

2. ig(0) = ig(1) = ig(2) = 1. 

3. Aig(k) = 0 AS)(k) =2 and Aig(k) =14 ASS(k) =0. 


Example 4.4 Numerical values of is(k) for k € {0,...,25} are, together with 
S(k), Sh(ig(k)), and the sum of first and third row minus k +1, which is denoted 
below by “slg(k)”: 

1,1,1,2,2,2,3, 4,4, 4,5,5,5,6, 7,8, 8, 8,9,9,9, 10, 11, 11, 11, 12. 

0,2, 4, 4,6,8, 8,8, 10, 12, 12, 14, 16, 16, 16, 16, 18, 20, 20, 22, 24, 24, 24, 26, 28, 28 
2,2,2,4,4,4,4,6, 6, 6,8, 8,8, 8,8, 10, 10, 10, 12, 12, 12, 12, 14, 14, 14, 16 
2,1, 0,2, 1,0, 0,2, 1,0,2,1,0,0, 0,2, 1,0, 2, 1,0, 0,2, 1,0,2. 


An alternative characterisation of is(k): 
Lemma 4.5 For k > 0: ig(k) is the minimal i € No with i+ S§(t) >k+1. 


Proof: The assertion follows by what has already been said above, plus the 
consideration of the corner cases: 0 + 55(0) = 0 < k+1 for all k > 0, while 
14+55(1)=3>k+1 fork <2. 


We obtain a method to prove lower bounds for $4(k): 

Corollary 4.6 For k,i € No with S$(4) >k—-i+1 holds S$(k) > 2(k —i+1). 
is(k) grows in steps of +1, while S$(k) grows in steps of +2: 

Lemma 4.7 AS5(k) € {0,2} and Aig(k) € {0,1} for all k E No. 

Proof: Proof via (simultaneous) induction on k: The assertions hold for k < 1, and 


so consider k; > 2. Now ig(k) is the minimal 7 € {1,...,k—1} with k+1 <i+5%(i), 
and due to AS$(i) > 0 for all i < k it follows Aig(k) € {0, 1}. 


We obtain a simple upper bound on is: 


Corollary 4.8 For k > 1 holds ig(k) < k and for k > 2 holds ig(k) <k—-1 


4.3. The “slack” 
An important helper function is the “slack” slg(k): 
Definition 4.9 For k € No let slg(k) := (ig(k) + S$(ig(k))) — (K +1) E No. 


So slg(0) = (1+ 2) — (0+ 1) = 2 and slg(1) = (14+ 2) —-(14+ 1) = 1. Directly from 
the definition follows: 


Lemma 4.10 For k > 0 holds S5(is(k)) = $.95(k) + sls(k). 
We can characterise the cases Aig(&) = 1 as the “slackless” k’s: 


Lemma 4.11 For k > 0: 
1. Aig(k) =1 slg(k) =0 & ASS(k) = 0. 


2. Aig(k) =0 © slg(k) > 1 6 ASS (k) = 2 


Proof: If slg(k) > 1, then Aig(k) = 0 by Lemma[4.5] while for slg(k) = 0 we get 
Aig(k) > 1. 


Thus the slack determines the growth of S}: 


Corollary 4.12 Fork >0 holds AS$(k) = 2+ min(slg(k), 1). 
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And plateaus of the slack happen only for slack zero, and from such a plateau 
the slack jumps to 2, and then is stepwise again decremented to zero: 


Corollary 4.13 For k > 0 holds: 
1. If slg(k) > 0, then slg(k + 1) = slg(k) — 1. 
2. If slg(k) = 0, then slg(k + 1) € {0,2}. 


4.4 A meta-Fibonacci recursion 


We are ready to prove an interesting nested recursion for $4. First a combinatorial 
lemma, just exploiting the fact that the shape of the slack repeats the following 
pattern (Corollary (4.13): a plateau of zeros, followed by a jump to 2 and a stepwise 
decrement to 0 again (where right at & = 0 we start with slg(0) = 2): 


Lemma 4.14 For k > 2 holds 3~7_, slg(k — i) = 307_, i- min(1, slg(k — 4)). 
Proof: There are 0 < p< 2 and 1 < q <3 such that the left-hand side is 
pap Wars La eee ee (2 = 1) per ag 


for p = 0 the initial part is empty, for q = 3 the final part is empty. Let r > 0 
be the number of zeros; so r = 0 iff p = 2 (and then also gq = 3). We have 
ptrt+(2—q4+1)=2,ie, p+r+1=dq. Now the right-hand side is 


Lae aap ee pl eee Og gs Le eee ee, 


and we see that both sides are equal. 


Theorem 4.15 For k > 2 holds 


S2(k) = J S2(is(k — ) 


(note that by Lemma[Z.8| holds ig(k — i) < k). 
Proof: By Lemma[4.10]and Lemma [4.14] holds 


» S3(is(k —4)) = (DJ sls(k — 4) + $9(k) — 5 (840) — S3(k —%)) = 
$5(k) + (> i+ min(1,sls(k —#))) — ey. ASS(k +i), 
i=1 i=1 j=0 


where now by Corollary £12] holds 377_, S329 AS$(k + i — j) = (AS%(k — 1)) + 
(ASS(k — 2) + AS%(k —1)) = 30?_, ¢- ASS(k — 1) = 207_, ¢- min(1,slg(k)), which 
completes the proof. 


Now we see that S} is basically the same as az (recall Subsection [3.3): 


Corollary 4.16 Vk € No : 55(k) =2- aa(k). 
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Proof: For the purpose of the proof let a2(k) := $.95(k) for k € No. So we get 
ag(k) =k for k € {0,1}, while is(k) = k + 1 — ag(k), and thus for k > 2: 


2 2 


ax(k) = 5 94(h) = 5 2 Shlis(k —4)) = Y aalis(k - 8) = 


i=l i=1 


S> ao(k —i+1—aa(k — 4), 


i=l 


and so the assertion follows by the equations of Definition [3.12] 
We obtain the main result of this section: 


Theorem 4.17 55 = S2 (recall Definition|3.4). 


Proof: By Corollary 4.16]and Theorem [3.16] 


5 On the number of full clauses 


First we review full subsumption resolution, CU{v}, CU{t} ~ C, and its inversion, 
called “extension” in Section[5.1] where some care is needed, since we need complete 
control. From a clause-set F with “many” full clauses we can produce further 
clause-sets with “many” full clauses by full subsumption extension done in parallel, 
and this process of “full expansion” is presented in Definition [5.3] The recursive 
computation of Sz via Definition captures maximisation for this process, and so 
we can show in Theorem[5.5] that we can construct examples of unsatisfiable hitting 
clause-sets Fy, of deficiency k and with Sj(k) many full clauses. It follows that Sp 
yields a lower bound on FCH (Conjecture B.J]says this lower bound is actually an 
equality). 


5.1 Full subsumption resolution 


As studied in Section 6] in some detail: 


Definition 5.1 ([25]) A full subsumption resolution for F € CLS can be per- 
formed, if there is a clause C ¢ F with CU {v},CU {0} € F for some variable v, 
and replaces the two clauses CU{v},C U{t} by the single clause C. For the strict 
form, there must exist a third clause D € F\ {CU {v}, CU {v}} with v € var(D), 
while for the non-strict form there must NOT exist such a third clause. 


If F’ is obtained from F' by one full subsumption resolution, then c(F’) = c(F’) — 1; 
we have the strict form iff n(F’) = n(F), or, equivalently, 6(F’) = 6(F) — 1, while 
we have the non-strict form iff n(F’) = n(F) — 1, or, equivalently, 6(F") = 6(F). A 
very old transformation of a CNF (DNF) into an equivalent one uses the inverse of 
full subsumption resolution 


Definition 5.2 ([25]) A full subsumption extension for F € CLS and a clause 
C € F can be performed, if there is a variable v € VA\var(C) with CU{v}, CU{t} € 
F,, and replaces the single clause C by the two clauses CU {v},C U {t}. For the 
strict form we have v € var(F’), while for the non-strict form we have v ¢ var(F). 


2) Boole introduced in [4], Chapter 5, Proposition I, the general “expansion” f(v,Z) = (f(0,Z)A 


BD) V (f(1,Z) A v) for boolean functions f, where for our application f(v,#) & C. This was taken 
up by [28], and is often referred to as “Shannon expansion”. 
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If we consider F € MU and C € F, then we can always perform a non-strict full 
subsumption extension, while we can perform the strict form iff C’ is not full. If 
we denote the result by F’, then for F € UHIT we have again F’ € UHIT, but 
for general F € MU we might have F’ ¢ MU; see [25| Lemma 6.5] for an exact 
characterisation. 


5.2 Full expansions 


We now perform full subsumption extensions in parallel to m full clauses of F, 
first using a non-strict extension, and then reusing the extension variable via strict 
extensions: 


Definition 5.3 For F € CLS andmeN, where fc(F) > m, a full m-expansion 
of F is some GE CLS obtained by 


1. choosing some F’ C FN A(var(F)) with c(F’) =m, 
2. choosing some v € VA \ var(F’) (the extension variable), 


3. and replacing the clauses C € F" in F by their full subsumption extension 
with v (recall Definition [5.Q). 


The choice of v in Definition 5.3]is irrelevant, while the choice of F’ might have 
an influence on further properties of G, but is irrelevant for our uses. The following 
basic properties all follow directly from the definition: 


Lemma 5.4 Consider the situation of Definition 


1. There is always a full m-expansion G (unique for any fixed F’, v). 


2. If F €UHIT, then G EUHIT. 
3. n(G) =n(F) +1, (G) =c(F) +m. 
4. 6G) =5(F)+m-1. 

5. fe(G) =2-m. 


We turn to the construction of unsatisfiable hitting clause-sets with many full 
clauses (for a given deficiency): 


Theorem 5.5 Fork €N we recursively construct Fy € UHTT5=x as follows: 
1. Fy := {{1}, {-1}}. 
2. For k > 2 let Fy, be a full ag(k)-expansion of Fi.(x)- 

Then we have fc(Fy,) = S2(k). Thus Vk © N: So(k) < FCH(k). 


Proof: If the construction is well-defined, then we get fc(F,) = 2-ag(k) = So(k) 
and 6(F,) = 6(Fis(ny) + a2(k) — 1 = ig(k) + a2(k) -1 =k for k > 2 by Lemma[5.4] 
(using Theorem [4.17] freely), while these two properties hold trivially for k = 1. 

It remains to show that 1 < ig(k) < k —1 and ag(k) < fe(Fig¢ny) for k > 2. 
The first statement follows by Corollary [4.8] while the second statement follows by 
Lemma [4.5] 
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6 Applications 
We start by sharpening the upper bound from LemmaB.8} 


Theorem 6.1 Fork € N holds S9(k) <nuM(k) <k +14 |logs(k)|. 
Proof: By Theorem[5.5]and Theorem 


We can also provide an independent proof of the lower bound of LemmaB.8} 


Lemma 6.2 Fork €N holds So(k) >k +1. 


Proof: We prove the assertion by induction. For k = 1 we have S2(1) = 2, so 
consider k > 2. We use Corollary[4.6] and so we need i € N with k+1 < 2(k—-i+1), 
i.e., i << 4+. So we choose i := | 44+] € N. We have i < k, and so we can apply the 
induction hypothesis to i: i+ S2(i) = [44] + S.([42]) > |44)4+ |4#)4+1= 
2,4 )+1> 2/42 -1)+1=k, and thus i+ $,(i) >k+1. 


When upper and lower bound coincide, then we know all four fundamental 
quantities; first we name the sets of deficiencies (recall Theorems [2.4] [2.5): 


Definition 6.3 SNM := {k EN: S2(k) = nM(k)}, SNMy := {k © N: S2(k) = 
nM,(k)}. 


By Sz < VDM < nM, < nM we get SVM C SNM, and: 


Theorem 6.4 For k € SNM, holds S2(k) = FCH(k) = FCM(k) = VDH(k) = 
VDM(k) = nMj(k). 


We prove now that the special deficiencies 2” — n,2" —n—1 (n > 1; note 
5(An) = 2” — n) considered in Lemmas 12.10, 12.11], where we have shown 
that for them the four fundamental quantities coincide, are indeed in SNM, and 
that furthermore the special deficiencies 2” —n +1 (n > 3), where nM, differs from 
nM, are in SNM: 


Lemma 6.5 Consider n € N. 
1. $2(2" —n) = 2", and for k € No holds S2(k) = 2" 6 27 -n<k<2"-1. 
2.2” ne SNM, while 2” —n+1,...,2°-14¢ SNM. 
3. Assumen>2 now. Then 2” -—n—-—1€SNM with So(2" —n—1) = 2" — 2. 


4. For n> 3 holds 2° —-n+1€SNM,. 


Proof: By Corollary 7.24] we have nM(2” — n) = 2”, while nM(2” —n-—1) = 
2” — 2 (remember that the jumps for nM happens at the deficiencies 2" — n). Thus 
S9(2” — n) < 2” and 52(2” —n—1) < 2” — 2. Since for the value 2” the sequence 
Sy has a plateau of length n (Lemmal[B.6), while nM is strictly increasing, for Parts 
it remains to show S2(2” —n) > 2". We show this by induction: For 
n = 1 we have $9(1) = 2 = 2!, while for n > 2 by induction hypothesis we have 
(2" —n) — (2-1 —(n—1)) +1 = 2"71! < $,(2"-1 — (n—1)), thus by Corollary 4) 
S_(2"—n) > 2-2-1 = 2”. Finally, for Part] we note $2(2"—n+1) = So(n) = 2” 
by Part[I] while nM,(k) differs from nM(k) exactly at the positions k = 2” —n+1 
for n > 3, where then nMj(k) = nM(k) — 1 = 2” ((25| Theorem 14.7]). 


So the lower bound of Lemma |[6.2]is sharp for infinitely many deficiencies: 


Corollary 6.6 We have So(k) =k+1 for allk =2"—-1,neEN. 
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7 Initial values of the four fundamental quantities 


The task of this penultimate section is to prove the values in Table [I] (in Theorem 
[7.3] of course, only the four fundamental quantities are open). 


k 1/2 3 4/5 6 7 8 9 10 11/12 13 
nM(k) 5 618 9 16 17 
nM, (k) 5 6(8 8 16 16 
VDM(k) 5 6/8 8 16 16 
VDH(k) 5 6/8 8 10 11 12 13 14/16 16 
FCM(k) 4 6/8 8 9 10 12 12 14[16 16 
FCH(k) 4 6([8 8 8 10 12 12 14/16 16 
Sk) 2l4 4 678 8 8 10 12 12 14/16 16 


Table 1: Values for the fundamental quantities for 1 < k < 18; in bold the columns 
not in SNMy,, while the vertical bars are left of the special deficiencies 2” — n, 
n> 2. 


Strengthening [25] Corollary 12.13], first we establish properties of F € MU such 
that the number of full clauses equals the min-var-degree, i.e., there is a variable 
which occurs only in the full clauses. We use var,ya(F’) := {vu € var(F) : vdp(v) = 
pvd(F)} for F € CLS with n(F’) > 0 (the set of variables with minimal degree). 
Furthermore we use DP,(F) (“DP-reduction”, also called “variable elimination” ; 
see [24] for more on this important operation) for F € CLS and v € var(F) for 
the result of replacing the clauses containing variable v by their “resolvents” on v, 
which for clauses C,D € F with v € C, 0 € D is (C \ {v}) U(D \ {B}), and is only 
defined in case C, D do not have other clashes. Indeed the special use in Lemma 
[71] yields the inverse of the expansion process from Definition 


Lemma 7.1 Consider F € MU with fc(F) = uvd(F) (and thus n(F) > 0). 
1. vatyva(F) is the set of all v € var(F) which occur only in full clauses of F. 
2. fe(F’) is even. 


3. For v € vatyva(F) and F’ := DP,(F) we have F’ € MUs_ sp) 220) 41° 


4. fe(F) < 2-FOM(6(F) — 2 +1). 


Proof: Consider v € var(F’) with vdr(v) = uvd(F). The occurrences of v are 
now exactly in the full clauses of F (Part [I). Every full clauses must be resolvable 
on v, and thus the full clauses of F’ can be partitioned into pairs {v}UC, {GT} UC 
for ee) many clauses C. This shows Part [2] Parts [3] 4] now follow by considering 
F’ := DP,(F): F” is obtained by replacing the full clauses of F' by the clauses C 
(i.e., performs a full subsumption resolution, which are all strict except of the last 
one, which is non-strict). The new clauses C are full in F’ (though there might be 


other full clauses in F’). Obviously F’ € MU and 6(F’) = 6(F) — £9 +1. 


For deficiency k = 7 we have the first case of FCH(k) < FCM(k): 


Lemma 7.2 FCM(7) = 9 = nM(7) —1, while FCH(7) = 8 = S2(7). 


Proof: By S2(7) = 8 we have FCH(7) > 8. By Lemma [7.1] Part [4] and by 
FCM(3) = 4 the assumption of FCM(7) = 10 = nM(7) yields the contradiction 
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10 < 2FCM(7—5+1) = 2-4 =8, and thus FCM(7) < 9. By Lemma[2.2]we obtain 
FCH(7) = 8. A clause-set F € MUs=7 with fc(F’) = 9 (and n(F’) = 4) is given by 
the following variable-clause-matrix (the clauses are the columns): 


foo 0 
+ + 0 
O+ + + 
t + 4 + O 
Let the variables be 1,...,4, as indices of the rows. Now setting variable 4 to false 


yields A3, where one non-strict subsumption resolution has been performed, while 
setting variable 4 to true followed by unit-clause propagation of {—3} yields Az. So 
both instantiations yield minimally unsatisfiable clause-sets, whence by Lemma 
3.15, Part 2] F e MUP 


We are ready to prove the final main result of this report: 


Theorem 7.3 Table[i] is correct. 


Proof: The values for 1 < k < 6 have been determined in [25] Section 14]. We 
observe that 1,2,4,5,6,9,11, 12,13 € SNMj, and thus by Theorem [6.4] nothing is 
to be done for these values, and only the deficiencies 7,8, 10 remain. 

By Lemma [7.1] Part 2] we get that FCH(8) = FCM(8) = 10 (since nM(8) = 
11 is odd), and also FCH(10) = FCM(10) = 12. By Lemma [7.2] it remains to 
provide unsatisfiable hitting clause-sets witnessing VDH(7) = 10, VDH(8) = 11 
and VDH(10) = 13. For deficiency 7 consider 


O + 4 4 
QO. — « aie te fos he 
-+++---+ 4+ 4+ 0 
- - - - + + + 0 


Fy = 


F; has 4 variables and 11 clauses, thus 6(F7) = 11 — 4 = 7; the hitting property is 
checked by visual inspection, and F7 is unsatisfiable due to 8-274 +2-279 +27? = 
4 + + + + = 1, while finally every row contains exactly one 0, and thus F7 is 
variable-regular of degree 10 = nM(7). 

Finally consider Aq with 6(A,) = 16 — 4 = 12 and pvd(A4) = 16: perform 
four strict full subsumption resolutions on variables 1,2,3,4, and obtain elements 
of UHTT of deficiency 11, 10,9,8 with min-var-degree 14, 13, 12,11. 


8 Conclusion and Outlook 


In this report we have improved the understanding of the four fundamental quanti- 
ties, by supplying the lower bound Sz < FCH. The recursion defining $4 sheds also 
light on Sz = S$, and we gained a deeper understanding of S2 = 2a2. Moreover we 
believe (based on further numerical results) 


Conjecture 8.1 Vk © N: S9(k) = FCH(k). 


This would indeed give an unexpected precise connection of combinatorial SAT 
theory and elementary number theory. On the upper bound side, by Conjectures 
12.1, 12.6 in (see Figure 1 there for a summary of the relations between the 
four fundamental quantities) we get: 


3) [25] Lemma 3.15] contains a technical correction over [23] Lemma 1). 
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Conjecture 8.2 Vk EN: nM(k) —1 < FCM(k) < VDM(k) = VDH(k). 


Recall that VDM(k) < nM(k); so we believe that three of the four fundamental 
quantities are very close to nM(k). This is in contrast to nM(k) — S(k) being 
unbounded, and indeed S3(k) = k +1 for infinitely many k (Corollary 6.6), while 
by Lemma [6.5] we also know S2(k) = nM(k) for infinitely many k, and thus 5S» 
oscillates between the linear function k+1 and the quasi-linear function nM(k). To 
eventually determine the four fundamental quantities (which, if our conjectures are 
true, boil down to VDM and FCM, while VDH = VDM and FCH = $3), detailed 
investigations like those in Section [f|need to be continued. 

As FCH(k) and S2(k) are closely related via (boolean) hitting clause-sets, via 
generalised (non-boolean) hitting-clause-sets (see for the basic theory) we 
can establish a close connection to the $,(k) for all prime numbers p in forthcoming 
work. Here S,,(k) is the smallest n € No such that p* divides n!, as introduced in 
[30) Unsolved Problem 49]. This generalisation to (finite) domain sizes (boolean = 
2) is also essential to realise the full power of the methods of this work, and to obtain 


applications to the field of |covering systems of the integers, where the relation to 


Boolean algebra was noticed in [3] (see for an introduction). 
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